Service node, network, and method for pre-fetching for remote program installation

ABSTRACT

A system for package pre-fetching for a remote program installation includes a service node having a processor, a computing node type database, and a cache, the service node being configured to receive at least one package request for a package required for an installation of an operating system and at least one peripheral application thereof from a computing node, and determine a package request sequence by which the computing node issues the at least one package request according to a type of the computing node. In another embodiment, a method includes receiving a package request from a computing node, and determining a package request sequence by which the computing node issues at least one package request according to a type of the computing node, so as to pre-read a subsequent package into a cache before the computing node issues a request for the subsequent package.

RELATED APPLICATIONS

This application is a continuation of copending U.S. patent application Ser. No. 12/277,937, filed Nov. 25. 2008, which claims priority to Chinese Patent Application No. 200710196066.2, filed Nov. 30, 2007, from all of which priority is claimed and which are herein incorporated by reference.

BACKGROUND

The present invention relates to a service node, network, and method for pre-fetching for remote program installation, and more particularly, relates to improving remote installation performance of operating systems and at least one peripheral application in a computer cluster environment.

Known tools help administrators remotely install operating systems and at least one peripheral application for respective client computers. A known tool includes a server-client architecture, as shown in prior art FIG. 1. A service node 10 stores software packages required for installing operating systems and at least one peripheral application associated with the operating system, to a computing node 12. Each computing node 12 runs a program installation unit 14, when the computing node 12 needs to install an operating system and at least one peripheral application of the operating system. The program installation unit 14 sequentially issues requests according to a predetermined order to the service node 10 for software packages required for installing the operating system and at least one peripheral application. The service node 10 sequentially sends software packages to the computing node 12 based on the sequential requests.

The computing node 12 may be classified according to its intended purpose. The computing node 12 may include computing nodes for scientific computation, business analysis, and statistics, for example. The operating system and at least one peripheral application needed for installation may be similar for each type of computing node 12. Therefore, while there are different types of computing nodes, the software packages which need to be requested for installing operating systems and their peripheral applications may be similar.

Software packages may be similar for each type of computing node request sequence, thus there may be request sequence similarity, because the installation of software packages needs to be done based on the installation of other software packages, The request sequence is recorded in the program installation unit 14. The computing node 12 runs the program installation unit 14, when a computing node 12 issues a package request. The program installation unit 14 sequentially issues package requests to the service node 10 according to a fixed order.

In a large cluster, response speed of a service node 10 is critical. Response speed is critical, since faster response speed of the service node 10, means reduced operating system and peripheral application installation time.

In a service node 10, a service program unit 16 sends a required software package to a computing node 12 in response to a request from the program installation unit 14 of the computing node 12. A file system cache 18 may be used in the service node 10. The service program unit 16 first searches in the cache 18, hen the unit 16 needs a package to send to the computing node 12. The cache 18 improves package reading performance of the service node 10, since a package can be read from the cache 18 faster than reading the package from an external storage device 20, for example. Thus, the service program unit 16 can respond to a package request from a computing node 12 at a higher speed using a cache 18, than without a cache.

Only packages that have been read previously can be found in the cache 18. The package must first be read from the external storage device 20 into the cache 18 by the service program unit 16. The service program unit 16 can then read out from the cache 18 when a package is read for the first time.

Packages in the cache 18 may also overflow. Due to the limited size of the cache 18, some algorithms may need to shift some packages or files out from the cache 18. Therefore, when the packages that have been previously recorded in the cache 18 are read again, it is possible packages have overflowed. Therefore, the service program unit 16 has to re-read the packages from the external storage device 20.

BRIEF SUMMARY

In one embodiment, a system for package pre-fetching for remote program installation includes a service node having a processor, a computing node type database, and a cache, the service node being configured to receive at least one package request for a package required for an installation of an operating system and at least one peripheral application thereof from a computing node, and determine a package request sequence by which the computing node issues the at least one package request according to a type of the computing node.

In another embodiment, a method for implementing package pre-fetching for remote program installation includes receiving a package request for a package required for an installation of an operating system and at least one peripheral application thereof from a computing node, and determining a package request sequence by which the computing node issues at least one package request according to a type of the computing node, so as to pre-read a subsequent package into a cache before the computing node issues a request for the subsequent package.

Various other features, exemplary features, and attendant advantages of the present disclosure will become more fully appreciated as the same becomes better understood when considered in conjunction with the accompanying drawings, in which like reference characters designate the same or similar parts throughout the several views.

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS

The figures form a part of the specification and are used to describe the embodiments of the invention and explain the principle of the invention together with the literal statement. The foregoing and other objects, aspects, and advantages will be better understood from the following non-limiting detailed description of preferred embodiments of the invention with reference to the drawings, wherein:

FIG. 1 is a schematic view of a prior art network architecture;

FIG. 2 is a view of network architecture according to one embodiment of the invention;

FIG. 3 is a schematic view of a flow diagram for reading packages from an external storage device into a cache in the network architecture according to one embodiment of the invention;

FIG. 4 is a schematic view of a flow diagram for pre-storing a package request sequence into a request sequence database in a network architecture according to one embodiment of the invention; and

FIG. 5 is a flow chart of a method for implementing package pre-fetching for remote program installation according to one embodiment of the invention.

DETAILED DESCRIPTION

The novel features believed characteristic of the invention are set forth in the appended claims. The invention itself, however, as well as a preferred mode of use, further objectives and advantages thereof, will best be understood by reference to the following detailed description of an illustrative embodiment when read in conjunction with the accompanying drawings.

FIG. 2 shows one embodiment of a network architecture. The network includes a service node 10 and computing nodes 12 a; 12 b, 12 c. While three computing nodes are illustrated, it is to be understood that there may be from 1 to “n” number of computing nodes 12, where “n” equals any desired number of computing nodes 12.

A computing node 12 may include an program installation unit 14 that sequentially issues at least one package request that is required for installing an operating system and at least one peripheral application associated with the operating system to the service node 10 according to a fixed request sequence. The service node 10 may include a service program unit 16, a cache 18, a computing node type database 22, and a request sequence database 24. An external storage device 20 for storing data may be coupled to service node 12. The external storage device 20 may be provided to store data, such as operating systems and their peripheral applications, which may be installed on the computing nodes 12.

The computing node type database 22 stores identification information for each computing node 12 requesting packages. The identification information for each computing node 12 may include an address, computing node type, and other identification information. The request sequence database 24 is provided for storing a package request sequence corresponding to each computing node type of the operating system and peripheral application to be installed on a computing node 12.

In one embodiment, the service program unit 16 is provided to determine the identification information, such as the address and type of computing node, of the computing node 12 issuing the package request. The service program unit 16 receives a package request from a computing node 12 and then searches the computing node type database 22 for the computing node type corresponding to the address of the computing node 12 issuing the package request. The service program unit 16 then searches the request sequence database 24 for the package request sequence corresponding to the computing node type. The service program unit 16 then pre-reads a subsequent package into the cache 18 from the external storage device 20 before the computing node 12 issues a package request for the subsequent package. In another embodiment, reading packages from the external storage device 20 is not required when the package request sequence associated with the type of the computing node 12 is already stored in the request sequence database 24. As a result, service node 10 response times are reduced substantially.

In one embodiment, the service node 10 reads a subsequent package into the cache 18 while the package is read from the external storage device 20, and while simultaneously sending the requested package, in sequence, to the computing node 12. The service node 10 reads a subsequent package in sequence into cache 18 from the external storage device 20, after receiving the package request from the computing node 12. In another embodiment, the service program unit 16 reads all packages in, other than the first package request in sequence into cache 18 from the external storage device 20 after having received the first package request and package request sequence from the computing node 12.

FIG. 3 shows a flow diagram of a network architecture and FIG. 5 shows a flow chart of an embodiment of a method. Referring to FIG. 3 and FIG. 5, in the network architecture, software packages are read into a cache 18 from an external storage device 20. In the example shown in the Figures, the software packages that are required for a computing node 12 to install an operating system, and at least one peripheral application associated with the operating system, include package Z, package A and package B. The program installation unit 14 issues a package request to the service program unit 16 in the sequence of package Z, package A, and package B.

The service program unit 16 first receives a request for package Z from a computing node 12, shown as step S1 in FIG. 5. The service program unit 16 determines the identification information, such as the address and type of computing node, of the computing node 12 issuing the request for package Z. The service program unit 16 then searches the computing node type database 22 for the computing node type corresponding to the address of the computing node 12 issuing the package request. The service program unit 16 then searches the request sequence database 24 for the package request sequence corresponding to the computing node type, shown as step S2 in FIG. 5, where it is determined that the package request sequence is package Z, package A, and then package B.

The service program unit 16 then pre-reads a subsequent package into the cache 18 from the external storage device 20 before the computing node 12 issues a package request for the subsequent package, shown as step S3 in FIG. 5. Upon receipt of the request for package Z, the service program unit 16 pre-reads package A into the cache 18 from the external storage device 20.

The service program unit 16 then sends package Z and package A simultaneously to the computing node 12. The service program unit 16 then receives a request for package A from computing node 12. The service program unit 16 then reads package B into cache 18 from the external storage device 20. The service program unit 16 searches the cache 18 for the package A, locates package A, and sends package A to the computing node 12. Computing node 12 then sends a request to the service program unit 16 for package B. Again, the service program unit 16 searches the cache 18 for the package locates it, and then sends package B to the computing node 12.

In another embodiment, the service program unit 16 first receives a request for package Z from a computing node 12. The service program unit 16 determines the identification information, such as the address and type of computing node, of the computing node 12 issuing the request for package Z. The service program unit 16 then searches the computing node type database 22 for the computing node type corresponding to the address of the computing node 12 issuing the package request. The service program unit 16 then searches the request sequence database 24 for the package request sequence corresponding to the computing node type, shown as step S2 in FIG. 5, where it is determined that the package request sequence is package Z, package A, and then package B. Upon receipt of the request for package Z, the service program unit 16 simultaneously reads package A and package B into the cache 18 from the external storage unit 20. Reading package B into cache 18 from the external storage device 20 may not be required if the computing node 12 sends a request to the service program unit 16.

In another embodiment, packages are not read from the external storage device 20, if the computing node 12 sends a package request to the service program unit 16. In this embodiment, the service program unit 16 reads packages from the cache 18 and not from the external storage device 20. The response time to service node 10 is substantially reduced by reading packages to the computing node 12 directly from the cache 18. The request sequence database 24 stores the package request sequence corresponding to each computing node type.

FIG. 4 shows a flow diagram of a network architecture for pre-storing a package request sequence in a request sequence database 24 according to one embodiment of the invention. In the embodiment, the package request sequence is recorded when a computing node 12 first receives all requested packages in the request sequence database 24.

The service program unit 16 identifies the corresponding computing node type issuing the package request, The service program unit 16 then queries the request sequence database 24 to determine if the package request sequence corresponding to the identified computing node type is stored in the database 24, when receiving a request for package Z from computing node 12. The service program 16 may indicate if the computing node 12 is not the first requesting a package in that type of computing nodes, The request sequence will not be recorded if the package request sequence corresponding to the computing node type is stored in the database 24.

The service program unit 16 may record the name of package Z in the request sequence database 24, if the package request sequence corresponding to the computing node type is not stored in the request sequence database 24. Computing node 12 receives package Z after the service program unit 16 reads package Z. The service program unit 16 then receives a request for package A from computing node 12. The service program unit 16 records the name of package A sequentially after the name of package Z, in the request sequence database 24. The service program unit 16 then reads package A and sends it to computing node 12.

Computing node 12 receives package A after the service program unit 16 reads package A. The service program unit 16 records the name of package B sequentially after the name of package A in the request sequence database 24. The service program unit 16 then reads package B and sends it to the computing node 12, The computing node 12 receives package B after the service program unit 16 reads package B.

In another embodiment, the package request sequence is recorded when a computing node 12, for each type of computing node, first acquires all requested packages in the request sequence database 24. The computing node type corresponding to the computing node 12 requesting the packages is identified by the service program unit 16, after the unit 16 receives a package request.

The service program unit 16 then determines whether the package request sequence of the computing node type is stored in the request sequence database 24. The request sequence by which the computing node 12 requests packages is not recorded if the package request sequence of that computing node type is already stored in the request sequence database 24. The package requests from the computing node 12 are sequentially received, if the package request sequence of the computing node type is not stored in the request sequence database 24. The request sequence is recorded in the request sequence database 24 after all the package requests are received if the request sequence database 24 has not yet recorded the request sequence.

In another in another embodiment, the request sequence database 24 can be disabled and the package sequence corresponding to the computing node type can be received using installation configuration files. The service program unit 16 stores the installation configuration files. The installation configuration files comprise a software package and a package request sequence for each computing node 12 installing operating systems and their associated peripheral applications. The request sequence by which the respective computing node 12 requests packages can be obtained if the installation configuration files are analyzed.

Those skilled in the art will appreciate that various adaptations and modifications of the just-described preferred embodiments can be configured without departing from the scope and spirit of the invention. Therefore, it is to be understood that, within the scope of the appended claims, the invention may be practiced other than as specifically described herein. 

What is claimed is:
 1. A system for package pre-fetching for remote program installation, the system comprising: a service node comprising a processor, a computing node type database, and a cache, the service node being configured to: receive at least one package request for a package required for an installation of an operating system and at least one peripheral application thereof from a computing node; and determine a package request sequence by which the computing node issues the at least one package request according to a type of the computing node.
 2. The system as recited in claim 1, wherein the service node is further configured to pre-read a subsequent package from an external storage device into the cache prior to the computing node issuing a request for the subsequent package.
 3. The system as recited in claim 1, wherein the service node is further configured to store identification information of each computing node in a computing node type database in an association with a computing node type.
 4. The system as recited in claim 3, wherein the identification information is an address.
 5. The system as recited in claim 3, wherein the service node is further configured to search for a corresponding computing node type in the computing node type database in order to determine the package request sequence.
 6. The system as recited in claim 5, wherein the corresponding computing node type is determined by the identification information stored in the computing node type database.
 7. The system as recited in claim 1, wherein the service node is further configured to construct a request sequence database based on one or more previous package requests from a computing node, the request sequence database being configured to store a plurality of package request sequences corresponding to each type of computing node.
 8. The system as recited in claim 1, wherein the service node further comprises a request sequence database for storing a package request sequence corresponding to each type of computing node.
 9. The system as recited in claim 1, further comprising the computing node, wherein the computing node is configured to issue the package request for installing the operating system and the at least one peripheral application associated with the operating system, based on an internal installation configuration file.
 10. The system as recited in claim 1, further comprising an external storage device coupled to the cache, wherein the service node is further configured to read a subsequent package, following a requested package from the external storage device, into the cache while returning the requested package to the computing node.
 11. The system as recited in claim 10, wherein the service node is further configured to read all of the request packages other than a first package in the package request sequence from the external storage device into the cache at one time after receiving a first package request from the computing node and acquiring the package request sequence corresponding to the computing node type.
 12. The system as recited in claim 1, wherein the service node is further configured to: search the cache to determine whether the requested package is in the cache after receiving a package request from the computing node; read the requested package from an external storage device into the cache; and read the requested package from the cache.
 13. A method for implementing package pre-fetching for remote program installation, the method comprising: receiving a package request for a package required for an installation of an operating system and at least one peripheral application thereof from a computing node; and determining a package request sequence by which the computing node issues at least one package request according to a type of the computing node, so as to pre-read a subsequent package into a cache before the computing node issues a request for the subsequent package.
 14. The method as recited in claim 13, further comprising: storing identification information of each computing node in a computing node type database in an association with a computing node type; determining identification information of the computing node issuing the package request; and searching for a corresponding computing node type in the computing node type database, determined by the identification information stored in the computing node type database.
 15. The method as recited in claim 13, further comprising pre-reading a subsequent package from an external storage device into the cache prior to the computing node issuing a request for the subsequent package.
 16. The method as recited in claim 13, further comprising: storing identification information of each computing node in a computing node type database in an association with a computing node type, wherein the identification information comprises an address; and searching for a corresponding computing node type in the computing node type database in order to determine the package request sequence, wherein the corresponding computing node type is determined by the identification information stored in the computing node type database.
 17. The method as recited in claim 13, further comprising constructing a request sequence database based on one or more previous package requests from a computing node, the request sequence database being configured to store a plurality of package request sequences corresponding to each type of computing node.
 18. The method as recited in claim 13, further comprising reading a subsequent package into the cache, following a requested package from an external storage device coupled to the cache, while returning the requested package to the computing node.
 19. The method as recited in claim 18, further comprising reading all of the request packages other than a first package in the package request sequence from the external storage device into the cache at one time after receiving a first package request from the computing node and acquiring the package request sequence corresponding to the computing node type.
 20. The method as recited in claim 13, further comprising: searching the cache to determine whether the requested package is in the cache after receiving a package request from the computing node; reading the requested package from the external storage device into the cache; and reading the requested package from the cache. 